Musical and Phonetic Controls in a Singing Voice Synthesizer
نویسنده
چکیده
have directed the project in Hamamatsu (Japan) and have visited us regularly in Barcelona. The rest of the people in the Music Technology Group have also to be mentioned, they have been always available to give us a hand when needed. In addition, I want to express my gratitude to Luis Vergara who has directed this thesis from the Polytechnics University of Valencia. Barcelona) in collaboration with Yamaha Corporation. The Music Technology Group, MTG, is a research group working on signal processing techniques for musical production and for other multimedia applications. Apart from pursuing the development of spectral audio models, MTG is dedicated to sound models for synthesis, the processing of audio based content and other issues related to Music Technology. On the other hand, Yamaha Corporation manufactures all kinds of musical instruments and professional audio equipment for professionals and amateur enthusiasts. From its base in Hamamatsu City, southwest of Tokyo, the company is also a leading producer of audiovisual products, semiconductors and other computer-related products, electronic equipment and specialty metals. The aim of the Daisy project is to synthesize a singing voice from a musical score. That is to say, from a given musical melody and a given lyrics in a particular language (English and Japanese in our case) our goal is to obtain an output sound as though a real singer was performing a song. Of course, this is not an easy thing, to say the least. However, with the accumulated knowledge in different fields, the use of new technologies and the increasing power of computers, this objective has become achievable nowadays. The artistic and technical disciplines relevant to this project cover an impressive variety of fields: sound recording and reproduction, music performance, music perception, phonetics, computer programming, digital signal processing… We can say that this is a really multidisciplinary enterprise. This research project presented here is a continuation of an automatic singing voice impersonator application for karaoke developed by the Music Technology Group [Cano, Loscos, Bonada, de Boer, Serra, 2000]. That system morphed in real time the voice attributes of a user (such as pitch, timbre, vibrato and articulations) with the ones from a prerecorded singer. 2 Because of my education as a musician and as an engineer, I have always been willing to work in an area in which I could apply my knowledge in both fields. And from the first time I heard about the …
منابع مشابه
Performance-driven Control for Sample-based Singing Voice Synthesis
In this paper we address the expressive control of singing voice synthesis. Singing Voice Synthesizers (SVS) traditionally require two types of inputs: a musical score and lyrics. The musical expression is then typically either generated automatically by applying a model of a certain type of expression to a high-level musical score, or achieved by manually editing low-level synthesizer paramete...
متن کاملImprovements to a Sample-Concatenation Based Singing Voice Synthesizer
This paper describes recent improvements to our singing voice synthesizer based on concatenation and transformation of audio samples using spectral models. Improvements include firstly robust automation of previous singer database creation process, a lengthy and tedious task which involved recording scripts generation, studio sessions, audio editing, spectral analysis, and phonetic based segmen...
متن کاملMandarin Singing Voice Synthesis Based on Harmonic Plus Noise Model and Singing Expression Analysis
The purpose of this study is to investigate how humans interpret musical scores expressively, and then design machines that sing like humans. We consider six factors that have a strong influence on the expression of human singing. The factors are related to the acoustic, phonetic, and musical features of a real singing signal. Given real singing voices recorded following the MIDI scores and lyr...
متن کاملReal-time CALM Synthesizer: New Approaches in Hands-Controlled Voice Synthesis
In this paper, a new voice source model for real-time gesture–controlled voice synthesis is described. The synthesizer is based on a causal-anticausal model of the voice source, a new approach giving accurate control of voice source dimensions like tenseness and effort. Aperiodic components are also considered, resulting in an elaborate model suitable not only for lyrical singing but also for v...
متن کاملSample-based singing voice synthesizer by spectral concatenation
The singing synthesis system we present generates a performance of an artificial singer out of the musical score and the phonetic transcription of a song using a frame-based frequency domain technique. This performance mimics the real singing of a singer that has been previously recorded, analyzed and stored in a database. To synthesize such performance the systems concatenates a set of element...
متن کاملSynthesis and Processing of the Singing Voice
As soon as the beginning of the 60Õs, the singing voice have been synthesized by computer. Since these first experiments, the musical and natural quality of singing voice synthesis has largely improved and high quality commercial applications can be envisioned for a near future. This talk gives an overview of synthesis methods, control strategies and research in this field. Future challenges in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001